# Inference Acceleration
Mera Mix 4x7B
Apache-2.0
mera-mix-4x7B is a Mixture of Experts (MoE) model with half the scale of Mixtral-8x7B but comparable performance and faster inference speed.
Large Language Model
Transformers

M
meraGPT
2,375
19
Prosparse Llama 2 7b
A large language model based on LLaMA-2-7B with activation sparsification, achieving high sparsity (89.32%) while maintaining original performance through the ProSparse method
Large Language Model
Transformers English

P
SparseLLM
152
15
Featured Recommended AI Models